Information-theoretical analysis of the statistical dependencies among three variables: Applications to written language
نویسندگان
چکیده
We develop the information-theoretical concepts required to study the statistical dependencies among three variables. Some of such dependencies are pure triple interactions, in the sense that they cannot be explained in terms of a combination of pairwise correlations. We derive bounds for triple dependencies, and characterize the shape of the joint probability distribution of three binary variables with high triple interaction. The analysis also allows us to quantify the amount of redundancy in the mutual information between pairs of variables, and to assess whether the information between two variables is or is not mediated by a third variable. These concepts are applied to the analysis of written texts. We find that the probability that a given word is found in a particular location within the text is not only modulated by the presence or absence of other nearby words, but also, on the presence or absence of nearby pairs of words. We identify the words enclosing the key semantic concepts of the text, the triplets of words with high pairwise and triple interactions, and the words that mediate the pairwise interactions between other words.
منابع مشابه
On Classification of Bivariate Distributions Based on Mutual Information
Among all measures of independence between random variables, mutual information is the only one that is based on information theory. Mutual information takes into account of all kinds of dependencies between variables, i.e., both the linear and non-linear dependencies. In this paper we have classified some well-known bivariate distributions into two classes of distributions based on their mutua...
متن کاملCanonical Analysis of the Relationship Between Personality Traits and Attitude with Motivation and EFL Learners’ Written Production Task
The purpose of this study was to investigate the Canonical analysis of the relationship between personality traits and attitude with motivation and EFL learners’ written production task. This research in terms of data collection procedure is a correlation type. The statistical population consisted of the students who were selected by random cluster sampling method. Data were analyzed using stan...
متن کاملThe Relationship betweenEFL Learners’ Self-Identity Changes, Motivation Types, and EFL Proficiency
This study aimed to explore the relationships between foreign language learners’ self-identity changes, motivation types, and Foreign Language proficiency associated with learning English in private language schools in Iranian context. Based on a stratified sampling, 204 English as a foreign language learners from three language schools in Tehran were selected to participate in the study. The i...
متن کاملPinpointing the classifiers of English language writing ability: A discriminant function analysis approach
The major aim of this paper was to investigate the validity of language and intelligence factors for classifying Iranian English learners` writing performance. Iranian participants of the study took three tests for grammar, breadth, and depth of vocabulary, and two tests for verbal and narrative intelligence. They also produced a corpus of argumentative writ...
متن کاملSemiotic Analysis of Written Signs in the Road Sign Systems of Tehran City
Introduction: as a component of the urban landscape, road sign systems are among the most critical elements of urban environments. Generally speaking, the written signs dominate the design of these systems. These signs can also foster aesthetic and visual pleasure compellingly and innovatively. Furthermore, they perpetuate a specific image in the minds of their observers. This research seeks to...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
- Physical review. E, Statistical, nonlinear, and soft matter physics
دوره 92 2 شماره
صفحات -
تاریخ انتشار 2015